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VIDEO CODING METHOD AND DECODCNG 
METHOD AND DEVICES THEREOF 

BACKGROUND OF THE INVENTION 

1. Field of the Inveatioa 

This invention can be used in low bic race video coding for 
tele -communicative applications. It improves the Temporal 
frame race of the decoder ourpuc as -.veil as the overall ?ic:ure 
qualir/. »c 

2. Related an of the Invention 

In a typical hybrid transform coding algorithm such as the 
rXU-T Recommendation H.261 (1| and MPEG [21 motion 
compensation is used to reduce the amount of temporal 
redundancy in me sequence. In the H.261 coding scheme. - 
the frames are coded using only forward prediction, here- 
after referred to as P-frames. In the MPEG coding scheme, 
some frames are coded using bi-direcrion prediction, here- 
after referred to as B-framcs. B-frames improve the e*n- 
ciencv of the coding scheme. No w the [U is rTUT Rec- :c 
oramendntion H.261 (Formerly CCTTT Recommeadation 
H.261; Codes for audiovisual services at p:<64 scbit/s 
Geneva. 1990 . and the |2| is ISO/EC 11172-2 1993 . 
Information technology — Coding of moving pictures and 

Q ' - associated audio for digital storage media at up to about 13 25 

Jjg Mbit/s - Pan 2: Video. 

^ However, it introduces delay in the encoding and 

c decoding, making it unsuitable for applications in the com- 

municative services where delay is an important parameter. ^ 
H* FIG. la and lb illustrates the frame prediction of H.261 and 30 

*g MPEG as described above. A new method of coding invoiv- 

ing the coding of the P and B frames as a single unit. 
i hereafter referred to as the PB-frame. was introduced. In this 

CS scheme the blocks in the PB-frames are coded and trans- 

s mined together thus reducing the total delay. In fact the total 35 

y_ achy should not be more than a scheme using forward 

~T prediction only but at half the frame rate. 

P FIG. 2a shows the PB-frame prediction. A PB-frame 

H= consists of rwo piaures being coded as one unit. The name ^ 

gg PB comes from the name of picture types in MPEG where 

there are P-frames and B-frames. Thus a PB-frame consists 
of one P- frame which is predicted from the last decoded 
P- frame and one B- frame which is predicted both from the 
last decoded P-frame aad the P-frame curready being ^. 
decoded This last picture is called B-frame because pans of 
it may be bi-directionally predicted from the past and future 
P-frame. 

FTG. 2b shows the forward and bi-directional prediction 
for a block in the B-frame. hereafter referred co as a 3-block. i0 
Only the region that overlaps with the corresponding block 
in the current P-frame. hereafter referred to as :hc P-bicck. 
is bi-directionally predicted Tne rest of the B-block is 
forward predicted from the previous frame. Thus only the 
previous frame is required in the frame store. The informa- j5 
qoq from the P-frame is obtained from the P -block currently 
being decoded. 

In the PB-block only the motion vectors for -he P-block 
is transmitted co the decoder. Tne forward and backward 
motion vectors for the B-clock is derived from the P motion 
vectors. A linear motion model is used and the cemp^ral 
referer.ee of the 3 and ? frame is used to scale ±c motion 
vector appropriately. FIG. 5*** depicts the motion vector 
scaling and the formula is shown below. 
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where 

NfV is the motion vector of the P-block. 
\fV^ and MV 5 are the forward and backward motion 
3 vectors for the B-block. 

TR B is the increment in the temporal reference from the 

last P- frame co the current B-frame. and 
TR P is the increment in the temporal reference from the 
last P- frame co the current P-frime. 
10 Currently the method used in the prior an assumes a linear 
. mocioa model. However this assumption is aot valid in a 
normal scene where the motion is typically not linear. This 
is especially (rue when the camera shakes and when objects 
are not moving at constant velocities. 
13 A second problem involves the quantization and trans- 
mission of the residual of the prediction error in the B-block 
CurrcQtly the coefficients from the P-block and the B-block 
are interleaved in some scanning order which requires the 
B-block efficients to be transmitted even when they are all 
20 zero. This is not very efficient as it is quite often that there 
are no residual coefficients to transmit (all coefficients arc 
zero). 

SUMMARY OF THE E^VENTION 

25 In order to solve the first problem, the current invention 
employs a delta motion vector to compensate for the non- 
linear motion. Thus it becomes necessary for the encoder to 
perform an additional motion search to obtain the optimum 
delta motion vector that when added to the derived motion 

30 vectors would result in the best match in the prediction This 
delta motion vectors are transmitted to the decoder at the 
block level only when necessary. A flag is used to indicate 
to the decoder if there are delta motion vectors present for 
the B-block. 

35 For the second problem, mis invention also uses a flag to 
indicate if there arc coefficients for the B-block to be 
decoded. 

The operation of the Invention is described as follows. 
40 FIG. 3a shows the Linear motion model used for the 
derivation of the forward and backward motion vectors from 
the P-block motion vector and the temporal reference infor- 
mation As illustrated in FIG. 3b. this model breaks down 
when the motion is not linear. The derived forward and 
4,5 backward motion vector is different from the actual motion 
vector when the motion is aot linear. This is especially true 
when objects in the scene are moving at changing velocities. 

In the current invention the problem is solved by adding 
a small delta motion vector to the derived motion vector to 
50 compensate for the difference berweea the derived and true 
motioa vector. Therefore the equations in (I) and (2) are now 
replaced by equations (3) and (4). respectively. 

\iV 9 ^(TR 9 ~TR,#.UWTR,-MV l> ^ 

where 

MV is the motion vector of the P-block. 
MV 0W(0 is the delta motion vector. 
i\fvy and NfVV arc the new forward and backward 
motion vectors for the B-block according to the current 
invention. 

TR 5 is the increment in the temporal reference from the 

. last P- frame to the current B- frame, and 
TR^ is the increment in the temporal reference from the 
last P- frame to the current P- frame. 



Note: Equations (3) aad (4) are used for the motion vector 
in the horizontal as well as the vertical directions. Thus 
the motion vectors are in pairs and there are actually two 
independent delta motion vectors, one each for the hori- 
zontal and vertical directions. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. la is a prior art which illustrates the prediction mode 
used in the ITU-T Recommendation H.26I Standard. 

FIG. 16 is a prior an which illustrates the prediction mode 
used in the ISO-IEC/JTC MPEG Standard. 

FIG. 2a illustrates the PB-frarae prediction mode. 

FIG. 2b illustrates the B-block bi-directional prediction 
mode. 

FIG. 3a illustrates the linear motioa model. 

FIG. 3b illustrates the non-linear motion model of the 
current invention 

FIG- 4 illustrates the encoder functionality block diagram. 

FIG- 5 illustrates the B- block bi-directional prediction 
functionality block diagram. 

FIG. 6 illustrates the decoder functionaltity block dia- 
gram. 

PREFERRED EMBODIMENTS 

The preferred embodiment of the current invention is 
described here. FIG. 4 illustrates the encoding functionality 
diagram. The present invention deals with the method for 
deriving the motioa vectors for the B-block. The encoding 
functionality is presented here for completeness of the 
embodiment. 

The encoding functionality block diagram depicts an 
encoder using a motion estimation and compensation for 
reducing the temporal redundancy in the sequence to be 
coded. The input sequences is organized into a first frame 
and pairs of subsequent frames. The first frame, hereafter 
referred to as the I- frame, is coded independent of all other 
frames. The pain of subsequent frames, hereafter referred to 
as PB-frame. consist of a B- frame followed by a P-frame. 
The P-frame is forward predicted based on the previously 
reconstructed I- frame or P-frame and the B- frame is 
bi-directionally predicted based on the previously recon- 
structed I- frame or P-frame and the information in the 
current P-frame. 

The input frame image sequence. 1. is placed in the Frame 
Memory 2. If the frame is classified as an I-frame or a 
P-frame it is passed through line 14 to the Reference 
Memory 3. for use as the reference frame in the motion 
estimation of the aext PB-frame to be predictivciy encoded. 
The signal is then passed through line 13 to the Block 
Sampling module 4. where it is partitioned into spatially 
noQ-overlapping blocks of pixel data for further processing. 

If the frame is classified as an I-frame. the sampled blocks 
are passed through Line 16 co the DCT module 7. If the frame 
is classified as a PB-frame. the sampled blocks are passed 
through Uqc 17 to the Motion Estimation module 5. The 
Motion Estimation module 5 uses information from the 
Refereace Frame Memory 3 and the curreat block 17 to 
obtain the motion vector for that provides the best match for 
the P-block The motion vector and the local reconstructed 
frame. 12. arc passed through line 19 and 20. respectively, 
to the Motion Compensation module 6. The difference 
image is formed by subtracting the motion compensated 
decoded frame. 21. from the current P-block. 15. This signal 
is then passed through line 22 to the DCT module 7. 




In the DCT module 7. each block is transformed into the 
DCT domain coefficients. The transform coefficients are 
passed through line 23 to Quantization module S. where they 
are quantized. The quantized coefficents are then passed 
5 through line 24 to the Run-length & Variable Length Coding 
module 9. Here the coefficients are entropy coded to form 
the Output Bit Stream. 25. 

If the current block is an I-block or a P-block. the 
quantized coefficients are also passed through line 26 to the 
to Inverse Quantization module 10. The output of the Inverse 
Quantization 10. is then passed through Ike 27 to the 
Inverse DCT module U. If the current block is an I-block 
Chen the reconstructed block is placed, via line 23. in the 
Local Decoded Frame Memory 12. If the current block is a 
P-block then the output of the Inverse DCT 29 is added to 
Che mo do a compensated output 21. to from the recon- 
structed block 30. The reconstructed block 30. is then placed 
in the Local Decoded Frame Memory 12. for the motion 
20 compensation of the subsequent frames. 

After the P-block have been locally reconstructed, the 
information is passed again to the Motion Compensation . 
Module 6. where the prediction of the B-biock is formed. 
FIG. 5 shows a more detailed functional diagram for the 
B-block prediction process. The P-motion vector derived in 
' the Motion Estimation module 51. is passed through line 57 
to the Motion Vector Scaling Module 53. Here the forward 
and backward motion vectors of the B-block is derived using 
30 the formula (1) and (2). respectively. In the present 
embodiment, an additional motion search around these 
. vectors is performed in the Delta Motion Search module 54. 
to obtain the delta motion vector. In this embodiment the 
motion vector is obtained by performing Che search for all 
delta motion vector values between -3 and 3. The delta 
motion vector value that gives the best prediction in terms of 
the smallest mean absolute difference in the pixel values of 
the B-block and the prediction block is chosen. The predic- 
40 tion is formed in the Bi-directional Motion Compensation 
module 55. according to FIG. 2b using the information from 
the Local Decoded Frame Memory 52. and the Current 
Reconstructed P-block 50. In the bi-directional prediction, 
only information available in the corresponding P-block is 
45 used to predict the B-block. The average of the P-block 
information and the information from the Local Decoded 
Frame is used to predict the B-biock. The rest of the B-block 
is predicted using information from the Local Decoded 
50 Frame only. 

The prediction difference block is then passed through 
line 22 to the DCT module 7. The DCT coefficients are then 
passed through line 23 to the Quantization module S. The 
result of the Quantization module 8. is passed through line 
53 24 to the Run-length A Variable Length Coding 9. In this 
module the preseace of the delta motion vector and the 
quantized residual error in the Ourpuf Bitstreara 25. is 
indicated a variable leagth code. NOB which is the acronym 
for No B-block This flag is generated in Run -length & 
60 Variable Length Coding module 9 based on whether there 
arc residual error in the Quantization module S and delta 
motion vectors found to the Delta Motion Search module 54 
is not zero. Table* I provides the preferred embodiment of the 
variable leogth code for the NOB flag. The variable length 
65 code of the NOB dag is inserted in the Output Bitstreaxn. 25. 
prior to the delta motion vector and quantized residual error 
codes. 



TABLE i 



(Variable !eqgih code for the MOB rtag* 

Quantized Residual Delta Mo cicn 



NOB Error Coded Vectors Coded 



0 No No 

10 No Yes 

U0 Yes No 

111 Yes Yes 10 



FIG. 6 shows the fractional block diagram for the 
decoder. The Input Bit Scream 31. is passed to the Variable 
Length & Run Length Decoding module 32. The block and ^ 
side information arc extracted in this module. If the frame is 
a PB-frame then the bitstream is checked if any delta motioa 
vector and/or quantized residual error coefficients present. 
The output of the module 32is passed through line 37 to the 
Inverse Quantisation module 33. The output of the Inverse ^ 
• Quantization 33. is then passed through line 3S to the 
Inverse DCT module 34. Here the coefficients are trans- 
formed back into the pixel values. 

If the current" frame is an I-frame then the output of 
Inverse DCT 34. is passed through line 39 and stored in the ^ 
" Frame Memory 42. 

If the current frame is a PB-frame. the side information 
containing the motion vectors are passed through line 45 to 
the Motion compensation module 34. The motion Compen- 
sation module 36. uses this information and the information ^ 
in the Local Decoded Memory. 35. to from the motion 
compensated signal. 44. This signal is then added to the 
output of the Inverse DCT module 34. to form the recon- 
struction of the P-biock. 

The Motioa Compensation module 36. then uses the 35 
additional information obtained in the reconstructed P-block 
to obtain the bi-directional prediction for the B-block. The 
B-block is then reconstructed and placed in the Frame 
Memory. 42. together with the P-biock. 

By implementing this invention, the temporal frame rate 40 
of the decoded sequences can be effectively doubled at a 
fraction of the expected cost in bit rate. The delay is similar 
to that of the same sequence decoded at half the frame rate. 

As described above in the present invention a new pre- 
dictive coding is used to increase the temporal frame rate 45 
and coding efficiency without introducing excessive delay. 
Currently the motion vector for the blocks m the 
bi-directionally predicted frame is derived from the motion 
vector of the corresponding block in the forward predicted 
frame using a linear motion model. This however is not 50 
effective when the motion in the image sequence is not 
linear According to this invention, the efficiency or this 
method can be further improved if a non-linear motion 
model is used. In this model a delta motion vector is added 
to or subtracted from the derived forward and backward 55 
motion vector, respectively. The encoder performs an addi- 
tional search Co determine if there is a need tor the delta 
motion vector. The presence of this delta motion vector in . 
che transmitted bitstreom is signalled to the decoder which 
then takes the appropriate action to make use ot the delta W 
motion vector to derive the effective forward and backward . 
motion vectors for the bi-directionally predicted b!oc£ 
^ What is claimed: 

d 1. A method for encoding a sequence ot video image 
frames comprising the steps of: 65 
dividing a source sequence into a set of group of pictures, 
each group of pictures comprising a first trame 



